ASRS-CMFS vs. RoBERTa: Comparing Two Pre-Trained Language Models to Predict Anomalies in Aviation Occurrence Reports with a Low Volume of In-Domain Data Available

نویسندگان

چکیده

We consider the problem of solving Natural Language Understanding (NLU) tasks characterized by domain-specific data. An effective approach consists pre-training Transformer-based language models from scratch using data before fine-tuning them on task at hand. A low volume is problematic in this context, given that performance relies heavily abundance during pre-training. To study problem, we create a benchmark replicating realistic field use to classify aviation occurrences extracted Aviation Safety Reporting System (ASRS) corpus. compare two new benchmark: ASRS-CMFS, compact model inspired RoBERTa, pre-trained only little data, and regular RoBERTa model, with no The benefits its size advantage, while ASRS-CMFS strategy. find compelling statistical evidence outperforms but show more compute-efficient than RoBERTa. suggest good strategy for NLU context scarcity.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a synchronic and diachronic approach to the change route of address terms in the two recent centuries of persian language

terms of address as an important linguistics items provide valuable information about the interlocutors, their relationship and their circumstances. this study was done to investigate the change route of persian address terms in the two recent centuries including three historical periods of qajar, pahlavi and after the islamic revolution. data were extracted from a corpus consisting 24 novels w...

15 صفحه اول

focus on communication in iranian high school language classes: a study of the role of teaching materials in changing the focus onto communication in language classes

چکیده ارتباط در کلاس به عوامل زیادی از جمله معلمان، دانش آموزان، برنامه های درسی و از همه مهم تر، مواد آموزشی وابسته است. در تدریس ارتباطی زبان که تاکید زیادی بر توانش ارتباطی دارد، کتاب درسی به عنوان عامل موثر بر پویایی کلاس محسوب میگردد که درس ها را از طریق فراهم آوردن متن ارتباط کلاسی و هم چنین نوع تمرین زبانی که دانش آموزان در طول فعالیت های کلاسی به آن مشغول اند، کنترل می کند. این حقیقت ک...

15 صفحه اول

‏‎interpersonal function of language in subtitling

‏‎translation as a comunicative process is always said to be associated with various aspects of meaning loss or gain. subtitling as a mode of translating, due to special discoursal and textual conditions imposed upon it, is believed to be an obvious case of this loss or gain. presenting the spoken sound track of a film in writing and synchronizing the perception of this text by the viewers with...

15 صفحه اول

critical period effects in foreign language learning:the influence of maturational state on the acquisition of reading,writing, and grammar in english as a foreign language

since the 1960s the age effects on learning both first and second language have been explored by many linguists and applied linguists (e.g lennerberg, 1967; schachter, 1996; long, 1990) and the existence of critical period for language acquisition was found to be a common ground of all these studies. in spite of some common findings, some issues about the impacts of age on acquiring a second or...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Aerospace

سال: 2022

ISSN: ['2226-4310']

DOI: https://doi.org/10.3390/aerospace9100591